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Abstract. We investigate the size of subspaces in sumsets and show two main results. 
First, if A C F2 has density at least 1/2 — o{n~ x / 2 ) then A + A contains a subspace of 
co-dimension 1. Secondly, if A C F2 has density at least 1/2 — o(l) then A + A contains 
a subspace of co-dimension o(n). 



1. Introduction 

In the paper [Bou90j Bourgain first addressed the question of showing that if A C 
{1, . . . , N} has positive relative density then A + A contains a very long arithmetic pro- 
gression. Since his work the problem has received considerable attention, and to help 
understand it better Green |Gre05] introduced a model version which has turned out to 
be interesting in its own right. It is this question with which we shall concern ourselves in 
this note. 

Suppose, as we shall throughout, that G := F2 and let fc denote the normalized counting 
measure on G. We are interested in what size of subspace one can guarantee that A + A 
contains, where A is a subset of G of density a := Wg(A). 

It turns out that there are various ranges of the density in which we see quite different 
phenomena. To begin note that if a > 1/2 then the inclusion-exclusion principle tells us 
that Pg(^4 R (x + A)) > for all x G G and so we have that A + A = G; we write this as 
follows. 

Proposition 1.1. Suppose that A C G := F2 has density a > 1/2. Then A + A contains 
a subspace of co- dimension 0. 

Once the density dips below 1/2 things begin to change. In this regime A may be 
contained in a subspace of co-dimension 1, and so A + A can, at best, be guaranteed to 
contain a subspace of co-dimension 1. To start with this is best possible: 

Theorem 1.2. Suppose that A C G := F2 has density a > 1/2 — e where e G (0, l/2 9 y/n\. 
Then A + A contains a subspace of co-dimension 1. 

Once e ~ 1/y/n, however, a different sort of behavior manifests. The worst known such 
is exhibited by the so called Niveau set construction of Ruzsa |Ruz87] and for us this yields 
the following theorem. 

Theorem 1.3 (Green- Ruzsa). For all e G (2 2 / v ^n, 1/2], there is a set A = A(e) C G := 
of density a > 1/2 — e such that any subspace contained in A + A has co-dimension Q(ey/n). 
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We are not able to prove a matching upper bound, although we are able to establish the 
following weak complement. 

Theorem 1.4. Suppose that A C G := F2 has density a > 1/2 — e where e G (0,1/2]. 
Then A + A contains a subspace of co-dimension O( log "„ 1 ). 

All previous work on showing that sumsets contain large subspaces has concentrated 
on the case of positive density (rather than density close to 1/2) and has consequently 
produced weaker results. It turns out the argument used for the previous theorem also 
yields a new result in this case. 

Theorem 1.5. Suppose that A C G := F2 has density a > 0. Then A + A contains a 
subspace of dimension fl(an). 

This improves upon |Gre05t Theorem 9.3] where a lower bound on the dimension of the 
form Q(a 2 n) was given. Recently Croot and Sisask |CS10j also established an improvement 
of roughly the strength of Theorem 11.51 but by different arguments in a much more general 
setting. 

Finally, Fourier analysis is notoriously weak when dealing with thin sets and, indeed, if 
a = o(logn/n) then it turns out that an elementary counting argument of Croot, Ruzsa 
and Schoen [CRS07] supersedes Theorem 11.51 

As indicated we make use of the Fourier transform the group G := F2 • in particular we 
denote the dual group by G and define the transform to be the map taking / e I^iG) to 

/(7) := EzeG f{xYy{x). 

We use basic results from Fourier analysis without comment and the reader interested in 
details may wish to consult Tao and Vu |TV06] . 

he note now splits into three further sections in which Theorem 11.2} Theorem 11.31 and 
Theorem 1 1 . 4 1 ( and 1 1 . 5 p are proved, followed by some concluding remarks in the final section 
including a discussion of the link with the integer version of the problem. 

2. Proof of Theorem 11.21 

The argument involves two main tools. The first is Plunnecke's inequality, |Pfii69] . which 
we record now. One of the key ideas in our work is to make use of it in the region when 
K~l. 

Theorem 2.1 ( |TV061 Corollary 6.28]). Suppose that A, B C G := F2 are such that 
Pg(^4 + B) < KT>g{A). Then for any positive integer k there is a set X C A with 
P G (X + kB) <K k F G (X). 

The second tool is measure concentration on the cube for which we shall follow McDi- 
armid |McD89j . The idea of using measure concentration was introduced to difference set 
problems by Wolf in |WollO] and that is the inspiration for our application. 

For any natural number n we write Q n for the cube {0, l} n . The Hamming metric on 
Q n is defined in the usual way: 

d(x,y) :=\{i:xi^ yi}\ for all x,y G Q n . 
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For any set A C Q n and r > we write 

Ham r (y4) := {x G Q n : d(x,y) < r for some y G A}, 

that is the set of points of at most distance r from A. Measure concentration provides a 
lower bound for the density of this set. 

Theorem 2.2 QMcD89, Proposition 7.7]). Suppose that A is a non-empty subset of Q n . 
Then for any r > we have 

, exp(— r 2 /2n) 

The formal similarity of F2 and Q n immediately tells us how we shall make use of this 
result. Suppose that E = {e%, . . . , e n } is a basis of G := F2, so that the map 

4>e ■ Qn -> G; x h> xi.et H h x n .e n 

is a bijection. Sets in G inherit certain growth properties from those in Q n as follows. 
Writing F := E U {0 G } we have that 

(2.1) A + rF D B (Ham r (0^(A))). 

To see this note that if z is a member of the right hand side then there is some y G A such 
that 

d{<p- E \z)^ E \y))<r. 

Note that this is perfectly well defined since <p E l is a bijection. Now, let x G {0, l} n be 
such that 

otherwise, 



so that 

On the other hand the number of % such that Xj 7^ is at most r and 0g G F, whence 
z G y + rF. (12.11) then follows. In light of this we have the following consequence of 
Theorem O 

Proposition 2.3. Suppose that G = W% and E is a basis of G, A is a non-empty subset 
of G and F := E U {0^}. Then for any r > we have 

The heart of the argument is the following asymmetric version of Theorem II. 21 

Proposition 2.4. Suppose that A C G := F2 density a > 1/2 — e where e G 
(0, 1/2 9 a 1 /?T] . TTien A + A contains a coset of a subspace of co-dimension 1. 
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Proof. In view of Proposition 11.11 we may certainly assume that P G (A) < 1/2. Put S : = 
(A + A) c and note that (S + A) n A = 0, whence P G (A + S) < 1 - P G (A). Since P G (A) = 
a > 1/2 — e, it follows that 

P G (A + 5) < F G (A) < exp(6e) P G (A), 

since e < 1/4. Suppose that E is a basis of G such that there is some s E S with s + i? C S. 
Since P G is translation invariant we have that F := E U {0 G } has 

P G (A + F) = P G (A + (s + F)) < W G (A + S) < exp(6e) P G (A). 

Now, let = \\/n\ and define a sequence of sets X ,Xi, . . . using Plunnecke's inequality: 
let ^ X C A be such that P G (X + kF) < exp(6eA;) P G (X ), and ^ X r+l C X r be 
such that 

(2.2) P G (X r+1 + 2 r+1 kF) < exp(6e.2 r+1 k) P G (X r+1 ). 

Since X r ^ and (X r ) r is nested, we see that the sequence (a/2 log 8 P G (X r )~ 1 ) r is 
increasing and bounded above by 0(-\/n). It follows that there is some r such that 
2 r > a/2 log8P G (X r ) _1 ; let r' be the minimal such. In this case 

T'- 1 < v /21og8P G (X f ,_ 1 )- 1 < a/2 log 8 P^X,.)- 1 
by nesting of (X r ) r , whence 

(2.3) v / 21og8P G (X r 0- 1 < 2 r ' < 2. a/2 log 8 Pc^)" 1 < 2 5 log 3 Pg( ^ - . 
But by Proposition 12.31 we have that 

r /, „v exp(-2 2r 'A; 2 /2n) 
VG(Xr , + 2<kF) + * Vg{ J > >1. 

Now by (12. 2 j) and the upper bound in ( 12. 3ft we get that 

P G (X r , + T'kE) < exp{6e.2 r 'k) P G (X r ,) < - 

since k < 2A/n < 2 _8 e _1 by assumption on e; whence 

exp(-2 2r 'A; 2 /2n) 1 



Pc(X r 



> 



On the other hand this can be bounded above using the lower bound in H 2 . 3 f) and the fact 
that k 2 > n: 

exp(-2 2r 'k 2 /2n) < (8 P G (X r /)- 1 )- fc2 / rt 1 
P G (X r ,) " P G (X r ,) " 8' 
This contradiction means that for all s G S, the set S — s contains at most n — 1 linearly 
independent vectors. Thus there is an element s G S 1 and a subspace i7 of co-dimension 1 
in G such that 5 C s + H . Since S = (A + A) c it follows that A + Ad(s + H) c ; (s + H) c 
is simply the coset of H not equal to s + H, whence we are done. □ 
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We now use the above result in a straightforward manner. 

Proof of Theorem \1.2. Let U be the smallest subspace of G such that A + A D U c . Note 
that such a space exists since A + A D = G c . 

By averaging there is a coset x + U on which v4 has relative density at least a: write 
A' := (A n (x + U) - x) and note that A + A D A' + A' and P l /(A / ) > a. By Proposition 
12.41 there is a subspace U' < U of relative co-dimension at most 1 and some u G U such 
that u + U' C A' + A' C A + A. We have three cases: 

(i) U' = U: then 

G = UUU C = U'UU C CA + A; 

(ii) U' and u + U' ^U': then A + A D U' c and dim [/' < dim Z7 contradicting the 
minimality of i7; 

(iii) U' U and u + U' = U'\ then let 7r : G — > U be a projection which is the identity 
when restricted to U and note that 7r _1 (?7 / ) is a subspace of G of co-dimension 1 
with 

7r~ l (u') = (7r- 1 ([/ , )nt/)u(7r- 1 (t/ / )nt/ c ) 

C U'UU c cA + A. 

The result follows. □ 



3. Proof of Theorem 11.31 

The argument here is a very slight adaptation of |Gre05l Theorem 9.4]. Green established 
this by reformulating Ruzsa's construction from [Ruz91j in the model setting where many 
of the details simplify. 

Proof of Theorem \1.3\ . Let 

A := {x G F2 : x has at most n/2 — rj\ / 27m/2 ones.}. 

Let X be the random variable which takes x G G to the number of Is in x. Fg is the 
uniform distribution on G, and X is the sum of n independent identically distributed 
Bernoulli random variables with parameter p = 1/2. 

The mean of each individual Bernoulli trial is 1/2 and the variance 1 /4 so that the mean 
of X is n/2 and the variance is n/4. It follows from the Berry-Esseen inequality (see, e.g. 
[Shi96l p374]) that 

3 2 

sup I F G (A) ~ $(-77V27r)| < -)=. 



Thus 



On the other hand 



3 2 

P G M) > $(-nV2n) 

'n 



_ 1 I pVzn ] 

$(2L^V2^/2J/V^) = = / exp(— x /2)dx > - 

2 V27T Jo 2 
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It follows that 

, „x 1 3.2 



2 ' ^ 

It follows that for eyfn > 2 2 we may pick r\ = 0.8e and get that Fg(A) > 1/2 — e. 

Now we shall show that if V < G has co-dimension at most d := \j]\/2nn J then V 
contains a vector with at most [r]\ / 27m\ zeros in the standard basis. Since any x G A + A 
has at least r]\ / 27in zeros in the standard basis we shall be done. Such a V can be written 

as 

V = {XiVt H h \ n - d v n ^ d : A t G F 2 }, 

where the t>jS are linearly independent. The UjS may be written in the standard basis as 

(i) W 

where (ej)j is the standard basis. The column rank of the matrix {e^)^ is n — d hence 
so is its row rank. Without loss of generality we may suppose that the first n — d rows 
(e^, . . . , e„_d), j = 1, • • • , n — d are linearly independent. It follows that we can solve the 
n — d equations 

Aie[ j) H h \ n - d e n _ d = 1 

for the A, giving a vector in V with no more than d zeros. The result follows. □ 

4. Proof of Theorem 11.41 

We shall prove the following stronger theorem from which both Theorems 11.41 and 11.51 
follow. 

Theorem 4.1. Suppose that A C G := F2 has density a < 1/2. Then A + A contains a 
subspace of co-dimension 

' n 2 ~ 2cr 
n/log2 T^ 

Before proving this we establish our two consequences. 

Proof of Theorem \1.4\ Apply Theorem 14. II with a = 1/2 — e. □ 

Proof of Theorem I i.5l It is easy to see that 

2 -2a 



<n(l-a/log2) + 0(l). 



Theorem 14. II then tells us that v4 + v4 contains a subspace of dimension at least an/ log 2 — 
0(1). □ 

The proof is inspired by the standard iterative method of Roth introduced in |Rot52] 
and the more commonly cited [Rot53] . which was adapted to finite fields by Meshulam in 
[Mes95j . The following lemma is the driver. 
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Lemma 4.2 (Iteration lemma). Suppose that A C G := F2 ^os density a > and V < G. 
T/ien t/iere is a subspace V <V of relative co-dimension 1 such that 

1 — 2(T 

JPV(F'\(A + A)) < F V (V\(A + A)). 

1 — a 

Proof. For each W G G/V fix some xw G and let Aw be AnW — xw considered as a 
subset of V. Since — 2xw = we have that A w + A w = A R W + A fl W 7 , whence 

WeG/V 

write S 1 := V \ (A + A). In view of the definition of S and our above observation, we get 
from Plancherel's theorem that 

(4.1) o = ( w*w,isW) = E E iw(7)r^(7)- 

Partition V into two sets A/" := {7 G 7 : Is (7) < 0} and 7? := {7 G V" : £5(7) > 0}. Since 
0y G P we have 



\ w ) 2 - 



E E iw(7)i 2 r5(7)>py(^) E p ^ 

On the other hand, from equation (14.11) and the definition of Af, we get 

E E IW(7)| 3 |fs(7)l = E E -IW(7)| 2 fs(7) 

= E E iw(7)i 2 ^(7) 

jev wgg/v 

> F V {S) Vv(Aw) 2 - 
weG/v 

By the Holder's inequality and then Parseval's theorem we have 

E E iw(7)i%(7)i < su P |f 5 (7)iE E iw(7)i 2 

■yeM weG/V jeAf ^0 d weG/V 

= sup 1 1^(7) I Yl (^v(A w )-Fv(A 
^ N wgg/v 

Combining this with the foregoing we get that 
(4.2) Fv(S) Yl ^ A 

w) < sup \ l,s (7)1 E (JPv(A w ) -Fv(A w ] 

WeG/V 7eAr W&G/V 

Now, by the Cauchy-Schwarz inequality 

EweG/vF v (A w ). ^ F V (A W ) < ^ fv(A w ) 2 . 

WeG/V WeG/V 



w) 2 ) 



2\ 



8 TOM SANDERS 

However, Ew&g/v Fv(Aw) = oc, whence 

(Fv(A w ) - Fv(A w ) 2 ) < (a' 1 - 1) Fv ( A 

WeG/V WeG/V 



2 

W) ■ 



Inserting this into (14.21) we get that 



2 



F V (S) Y] Fv{A w f< sup|f 5 ( 7 )|(a- 1 -l) V F V {A W ] 

Since a > and G/V is finite we may divide by the sum and conclude that 

sup|l s ( 7 )| > Fv{S). 

■yeAf I - a 

Let 7 G A/" be such that this supremum is attained and write V := {7}" 1 ". V has relative 
co-dimension 1 and, in view of this, we note that V \ V = Xq + V for any x G V \ V. 
Now 

p y (S n 7') - F V (S n (x + V)) = 15(7) < Pv(S), 

1 — a 

since V = {7}" 1 " and 7 G A/". Furthermore, since V and xo + V partition V we have that 

F V (S n V') + Fv(S n (ac + V')) = Fv{S). 
Adding these two expressions tells us that 

Fv ,(s n n = 2p v (5 n V) = ( 1 - j p v (S) = Pv(S). 

\ 1 — ay 1 — a 

It remains only to note that S (1 V = V \ (A + A) and the lemma is proved. □ 

Proof of Theorem \4.1\ Let o G (0,1) be a parameter to be optimized later. We use the 
iteration lemma to produce a sequence of subspaces Vi such that 

(i) Vi < Vi-i and the co-dimension of V, in V^_i is 1; 

(ii) P^ \(A + A))<^ FvUVi-i \ (A + A)). 

We set Vo = G, and apply the iteration lemma (Lemma 14.21) repeatedly to get the sequence. 
Now, if \Vi \ (A + A)\ < 1 then Vi\ (A + A) = 0, whence A + A contains a subspace of 
co-dimension i by 1$). In view of (JI]) and (Jn|) this certainly happens if 

\V l \FvXV\{A + A))<2 n ' i ( l -f^\ <1; 



1 — a 

taking i minimal such that this inequality is satisfied yields the result. □ 
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5. Concluding remarks 



As noted in the introduction Theorem 11.21 is best possible, however there is still a large 
gap between Theorem 1 1 . 3 1 and Theorem II .41 It has been suggested in |Gre05] that the truth 
is closer to Theorem 11.31 in the case when the density of the set is One might take 

Theorem 11.21 as some support of this conjecture (at least in the case of density 1/2 — o(l)). 
An intermediate question might be the following. 

Question 5.1. Suppose that A C G := F£ has density a > 1/2 — C/y/n. Does A + A 
contain a subspace of co-dimension Oc(l)? 

Theorem 11.31 tells us that this cannot be sublinear, and while showing that may be hard 
it could be that 0(C 2 ) is rather more accessible. 

Theorem 11.41 (and Theorem 11.21) are the first results which provide a sensible upper 
bound on the co-dimension rather than lower bound on the dimension of the subspace 
found in A + A, and given the proof one might imagine an improvement to Theorem 11.41 
would be possible. 

The following is a well-known theorem (see, for example, Metsch [Mct03j). 

Theorem 5.2. Suppose that G := F2 and S C G\{0g} meets every subspace of dimension 
d. Then \S\ > 2 n+1 ~ d - 1. 

This result can be used in the proof of Theorem 14.11 above to tell us that once we have 



in the iteration then A + A must contain a subspace of dimension d. This certainly happens 
when 



again taking % minimal such that this inequality holds we get a saving of 1 in the co- 
dimension. This is, of course, not particularly impressive and, indeed, no naive attack 
along these lines will work as it turns out that Theorem 15.21 is also best possible. (Again, 
see Metsch [Mct03j.) This may be contrasted with the following result of Alon. 

Theorem 5.3 ( |Alo07t Theorem 4.1]). Suppose that G := F% and S C G \ {0 G } is such 
that \S\ < cy/\G\/ log \G\. Then there is a set A C G such that S = (A + A) c . 

Here, of course, the set A produced is rather thin and certainly nowhere near the densities 
we are looking for. 

One may also reasonably ask what happens in the transition to the integers. Theorem 
11.51 can be proved there through the machinery of Bohr sets as developed by Bourgain 
|Bou99] . However, as the reader will have realised from the proof, the strength comes from 
the rather precise subgroup structure which is not present in general and as a result the 
conclusions are weaker than what is already known |Gre02j . 



V l \(A + A)\<2 



,n~i+l—d 
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